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THE RELATIVE DISTRIBUTION OF THE REAL ROOTS OF A 
SYSTEM OF POLYNOMIALS* 



C. F. GUMMER 
1. Introduction 

The problem about to be discussed may be looked upon as a generalization 
of the classical problem solved by Sturm in 1829 with regard to the real roots 
of a polynomial. f In Sturm's theorem it is shown how the number of distinct 
roots of a single polynomial which fall within a given real interval may be de- 
termined through a process rational in the coefficients. In the present paper 
we shall study a system of two or more polynomials in a single variable, and our 
aim will be to develop a rational process by which the order of succession 
of the roots of the several polynomials in a given real interval may be dis- 
covered. 

Let us confine ourselves, at least for the present, to the case where the end- 
values of the interval are not roots and where the polynomials have only simple 
roots in the interval. It is clear that in such a case Sturm's theorem determines 
the only relations of the real roots of a single polynomial to the interval that 
remain invariant under continuous transformation of the real number system 
into itself. A similar remark applies to the theory about to be developed 
with respect to the real root system of several polynomials; so that we are 
undertaking the study of a problem which, from the point of view of a one- 
dimensional analysis situs, may be said to be the fundamental problem of the 
system under consideration. 

The greater part of the paper will be devoted to the case of two polynomials. 
If the roots of the first are denoted generally by a and those of the second by j8, 
and if we write down the roots within the interval considered in increasing nu- 
merical order (as aaa(30aPP(3a(3), the |S's effect a certain partition of the as 
(in the present case into groups of 3, 0, 1,0,0, 1, 0,). The solution of the problem 
will consist in the determination of the numbers of a's in the successive groups. 

* Presented to the Society, September 7, 1920. 

fBulletin de Ferussac, vol. 11 (1829); Memoir es des Savants 
e t r a n g e r s, vol. 6 (1835). 
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Previous writers have considered a less precise form of the problem, namely 
that in which a- and /3-groups are counted only modulo 2. Cauchy* in 1837 
or earlier examined the relation of the remainders in the highest common factor 
process for two polynomials to the changes in sign of these polynomials, thus 
obtaining essentially the solution of our problem modulo 2. Sylvester! in 
1853 gave to these relations the interpretation in which we are here interested, 
in connection with the ordinal distribution of the roots. In Sylvester's paper 
the arrangement of a'$ and /3's is called an "intercalation scale." The arrange- 
ment obtained by removing pairs of consecutive a's or /3's as long as any such 
pairs are present is called an "effective intercalation scale." Thus, in the 
above example, the arrangement reduces, by cancellation of the a-pairs, to 
a/3/3a/3/3/3a/3; then, by removal of /3's, to aa/3a/3; and finally the effective inter- 
calation scale is fiafi. This reduced arrangement is connected by Sylvester 
with the signs that the two functions and their Sturmian remainders take at 
the ends of the interval. % A generalization of the theory to the case of n func- 
tions in (n — 1) variables, proposed by Sylvester,! was carried out by Kro- 
necker|| in 1869, in his theory of the "characteristic," which determined, 
also to modulus 2, the relative arrangement of the intersections of one closed 
curve in (n — l)-space with two bypersurfaces. Neither these nor other writers 
appear to have considered the problem of the complete root-distribution, whether 
in one or more variables.^ It is the object of the present paper to obtain a 
solution for the one-dimensional case, to make some generalizations, and to dis- 
cuss a number of special points in connection with the theory. 

The solution of the problem for two polynomials will be found in § 3, an al- 
ternative method being given also, in § 13. In both methods use is made of the 
sequence of Sturmian functions derived from one of the polynomials. In the 
one case these serve as material for a number of additional Sturmian processes. 
In the alternative method they are used in building up a number of quadratic 
forms on whose signatures the solution depends. 

* Calcul des indices des fonctions, Journal de l'Ecole polytechnique, 
vol. 25 (1837), pp. 176-229. Part of the theory was developed in a memoir presented to the 
Turin Academy in 1831. 

t A theory of the syzygetic relations of two rational integral functions, comprising an appli- 
cation to the theory of Sturm's functions and that of the greatest algebraical common measure, Phil- 
osophical Transactions of the Royal Society of London, vol. 
143, part 3 (1853), pp. 407-548; Collected Mathematical Papers, vol. 1, pp. 429-586. 

t Ibid., Art. 48-50; (Collected Mathematical Papers, vol. 1, pp. 517-521). 

§ Ibid., Art. 55; (Collected Mathematical Papers, vol. 1, pp. 527-8). 

1 1 Ueber Systeme von Functionen mehrer Variabeln, Berliner Monatsber- 
richte, 1869, pp. 159-183, 688-697. 

Ueber die Charakteristik von Functionen-Systemen, ibid., 1878, pp. 145-152. 

9 See also Cayley, On the geometrical presentation of Cauchy' s theorems of root-limitation, 
Transactions of Cambridge Philosophical Society, vol. 12, part 
2 (1877), pp. 395-413; Collected Mathematical Papers, vol. 9, pp. 21-39. 
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In § 7 will be found the theory for a system of more than two polynomials. 

In § 8 the simple interval previously used will be replaced by the set of inter- 
vals denned by a system of algebraic inequalities; and in § 9 will be discussed a 
point arising out of this notion, whose chief interest consists in the connection 
that it makes with the restricted theory of Cauchy and Sylvester. 

Since the reader may feel some disappointment at the laborious character 
of the computations demanded in numerical applications of the theory, it seems 
proper to include (in §§5, 10, and 12) a number of remarks tending to mitigate 
this condition. Even so, however, the main interest of the problem must re- 
main theoretical; nor does it seem likely that any method could be devised 
capable of rapid application to numerical cases. 

Among a variety of special problems to which § 11 will be devoted, we may 
notice in particular the determination of the number of separate intervals of the 
real number-system defined by a set of real algebraic inequalities. Two solu- 
tions will be given for this problem (in vii and viii). 

2. The Cauchy-Sylvester theory 

It will be useful to have before us the existing theory of the reduced root- 
distribution in a form that can be readily applied in obtaining a complete theory. 

Let a and b be two real numbers, a < b. Let/ (x) be a polynomial with real 
coefficients, which does not vanish when x = a or b, and which has in the interval 
(a, b) no multiple roots and no roots in common with another given real poly- 
nomial <p(x). Let/i(*:) be the first derivative of fo(x). Let those real roots of 
/o(x) which lie in (a, b) be «i, a 2 , . . . 

Let S(/ , <p) denote the Sturmian sequence \fa(x), ip(x), <p'(x), <p"(x), . . .] 
obtained as follows: ~f>'(x) is the polynomial found as remainder when /o is 
divided by <p, — <p" the remainder when <p is divided by <p', and so on, the last 
function in the sequence being the highest common factor of /o and <p. Let 
V(x) be the number of variations of sign between consecutive elements of the 
sequence 5(/o, <p) for any given x. 

The Cauchy-Sylvester theorem may then be stated in the following form: 
the number of a's making the quotient <p(a)/fi(a) positive exceeds the number of as 
making it negative by V(a) — V(b). 

This excess is called by Cauchy the "index" of / and <p in (a, b). 

It is easy to see that, with a suitable convention as to sign, the index enu- 
merates the a's in the reduced or effective intercalation scale, and it is in such a 
form that Sylvester obtains the theorem.* It is, however, a simple matter to 

* For a physical interpretation of the effective intercalations in terms of two interlacing 
strings, see Sylvester, Sur I'entrelacement d'une fonction par rapport a une autre, Journal 
fur Mathematik, vol. 88 (1880), pp. 1-3; Collected Mathematical Papers, vol. 3, pp. 
449-450. 
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prove the rule in its present shape by studying the continuous variation of x, 
and by observing that (i) the last function of the sequence vanishes for no x 
in (a, b), (ii) when any other function of the sequence, except fo(x), vanishes, the 
two adjacent functions have opposite signs, and (iii) when x = a + 5, fa(a) 
being zero, f (x) /fi(x) has the same sign as 5 for small enough values of | 5 j. 

An immediate corollary of this theorem will be of use to us a little later. 
With the same assumptions, we may replace <p(x) by the product of <p(x) by 
fi(x). Hence the number of a's making <p(a) positive exceeds the number making 
it negative by U(a) — U(b), where U(x) is the number of variations of sign in 

3. Complete theory for two polynomials 

Let a and b be real numbers of which a < b. Let/ (a;) and go{x) be given poly- 
nomials with real coefficients, and/i(x), gi(x) their first derivatives. Let us 
denote the sequence of Sturmian functions S(g , gi) by go(x), gi(x), gz(x), . . ., 
g k (x).* Let us assume that (i) neither a nor b is a root of fo(x), (ii)/ («) has 
no multiple root in (a, b), (iii) fo(x) has no root in (a, b) in common with any 
one of go(x), gi{x), . . ., gk(%)- Suppose the roots of f Q (x) in (a, b) to be « 1? a 2 , 
. . . , and suppose the distinct roots of g (x) in the interval a < x ^ b to be 

Pi, /3 2 f Out object is to show how the partition of the a's by the /3's 

may be determined. 

We naturally inquire how many of the a's give to the sequence g (a), gi(a), 
. . . , gk(a) some particular number r of variations of sign. For, if we knew this 
variation number r for a certain a, we should be able to identify, by using 
Sturm's theorem, the /3-interval in which a was contained. Let N r be the 
number of a's giving to g , gi, . . . , gj exactly r variations. The problem is 
then to determine N r . 

It will be convenient to denote the product g,-_i(#) -g,(*) by G t (x), and the 
product G h (x) -GiSx) . . .G it (x) by G tlil ---u(x), or briefly by G (f) (x). A product 
of g's may in like manner be written g^(x) ; but, for the most part, it will be 
convenient to write formulas in terms of the G's. It will be understood that 
G{x) or g{x), without suffix, represents unity. 

There will be a permanence or a variation between g;_i(a) and gi(a) according 
as Gi(a) is positive or negative (by assumption (iii) it cannot be zero). There- 
fore N r is the number of a's making exactly r of the quantities (Ji(a), . . ., 
Gk(a) negative. 

Let us denote by L h „ •■•' or L w , the number of a's making the particular set 

* Or let go, gi, ■ ■ ■ , gh be any general Sturmian sequence, however obtained, in the sense 
that its loss of variations agrees with that of the sequence defined above. 

t The reader may prefer to assume throughout that go has no multiple roots in the inter- 
val, and that neither a nor b is a root of go. 
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of functions G h , G h , . . . negative, and the remaining Cs positive. Then it is 
evident that N r is equal to the sum of those L's that possess r suffixes; that is 
to say, No = L, Ni = X]L,-> ■ ■ ■ , N k = L i2 . . . k . 

In order to find the L's, let us connect them with another set of integers which 
it will be possible to determine directly. Let M hh . . , or M (>) , be the excess 
of the number of as making the product G hh . . . positive over the number 
making it negative. In particular, M (without suffix) is the number of roots 
of fa{%) in the interval. It will also be understood that m (i) has the same mean- 
ing with reference to g (l) . 

It is clear that a particular a contributes +1 or — 1 to M hh ... according as 
it gives negative values to an even or an odd number of the functions G h , C,,, 
. . . whose suffixes appear in the symbol M ilh ... . If then we denote by { (i), (/) } 
the number of elements common to the two sets of integers (i) and (j), we find 
that the 2 k integers M (j) are connected with the 2 non-negative integers L w 
by means of the system of 2 equations 

(1) £(-!)<«■ W »L W =M „ 



where the summation covers all combinations of i's (including the null-set) 
and there is one such equation for each combination of /'s. 
To solve (1) for the L's we notice that, whenever (?) and (i') are not the same 

set 

y~Y— i)<W' w) >(— i)K'">, o)) _ o. 

For, if e is some element present in one but not both of (i) and (i'), the sum on 
the left side is composed of pairs of equal and opposite terms differing only in 
the presence or absence of e in (/). 

Hence, on multiplying (1) by (— l) 1(l } ' ' and adding, we have, after 
writing (t) for (i'), 

(2) L (0 = 2-*V(-l) |W '°' )l itf<;>- 

The problem is now reduced to the determination of the M's; and for this we 
have the method of §2 available. Denoting by H a) (x) the product G^x) •- 
fi(x), let us form by the Sturmian division process the sequence S[fo, H^y] = 
[/o, fly), fl'o> fl*0)> • • • ]• K V(/)(*) represents the number of variations 
of sign in the sequence for any given x, the required formula is, by the corollary 
in §2, 

(3) M y) = V a) (a) - V 0) (b). 

An alternative process for finding M^ will be given in §13. 
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We now have a complete chain of relations leading to the values of N r for 
r = 0, 1, . . . , k. These relations contain the solution of the problem for two 
polynomials as far as the relative distribution of the as and the intervening 
/5's are concerned. They do not disclose the number of outlying 8's at either end 
of the interval. These numbers, however, are easily found by observing the 
values of go, g\, ■ ■ ■ when x = a and b. If we introduce the fresh assumption 
(iv) that neither a nor b is a multiple root of go, and if the sequence go, gi, . . . , 
g k shows k variations for x = a and X variations for x = b, while k' and X' are 
the highest and lowest values of r for which N r ^ 0, it is clear that the numbers 
of marginal /3's are k - k' and X' — X, on the understanding that the interval 
includes b but not a. 

4. The distribution function 

We are interested, not in the individual numbers L w , but in the sum of those 
among them that involve r suffixes, since this sum is equal to N r . It will be 
seen that the N's are not capable of simple expression in terms of the M's. 
For some purposes the most convenient way of exhibiting the relation between 
the N's and M's is by means of a generating function 

E(t) = 2>/. 

r - 

On writing N r in terms of Hie L's and then eliminating the latter by means of 
(2), we find that 

(4) E(t) = 2~*£ £(-l)<»' °»M /, 

CO (0 

where, in any term of the sum, r is the number of elements in (i). 

Now it is not difficult to see that, for any particular set (/), 5 in number, 

(5) Z)(-i) {(0 - U)] f = (i + t) k -\\ - ty. 

(0 
For the linear factors of the right member may be arranged in such order that 
the /th factor is (1 — t) for every / belonging to (/)• The general term of the 
product (if like terms are not collected) is then the positive or negative product 
of a number of t's drawn from the iith, 4th, . . . factors, the number of negative 
factors in this term being {(i), (/)}• The complete product contains one 
such term for each possible combination (i), which shows that (5) is correct. 
By means of (5), (4) may be reduced to the form 

E(t) = 2-*X)m u) u + o*- J (i - ty, 

V) 
s being the number of elements in (/). 
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If P s stands for the sum of those M's having s suffixes, we have finally the 
identity in t 

(6) !>/ = E(t) = 2- k jy s {i + o* _, (i - ty. 

r = s = 

The P's being known, the expansion of the right member in descending powers 
of t will give, by means of its coefficients, the numbers of a's in the successive 
/3-intervals from left to right. We may call E(t) the "distribution function." 

Since the P's are formed from the M's in the same way as the N's from the 
L's, a reference to equations (1) and (2) will convince us that, corresponding 
to (6), there is another identity (in u) 

k k 

(7) J>y = XX(1 + u) k - r (l - u) r . 

s = r = 

In fact (7) may be obtained from (6) by the transformation t = (1 — «)/(l + u). 
To illustrate the occasional advantage of the reciprocal formula (7) over (6), 
let us write the conditions that the root-distribution should be PPaafi, taking 
k = 3. It is required that /]j*ji = 2(1 + u) 2 (1 — u), so that the condi- 
tions are P = 2, P% = 2, P 2 = — 2, P 3 = —2. If a pair of polynomials does 
not satisfy these conditions, the fact is likely to be discovered without computing 
all the M's. On the other hand, the use of the f-identity (6) gives four conditions 
each of which involves all the eight M's. 

5. Remarks on the preceding theory 

(i) The relations expressed in terms of the G's in equations (1), (2), (3), 
and (6) may also be put in terms of the g's. Equations (1), (2), and (3) will 
merely exchange the capitals for small letters (to be understood in a corre- 
sponding sense), except that the factor 2~ will be replaced by 2 ; but (6) 
will go into a less simple form, and it is therefore convenient to keep the for- 
mulas in terms of the capital letters. In numerical work, however, the calcu- 
lations will actually be made, as the following remark will show, with functions 
of lower degree than would appear from the general formulas. 

(ii) In computing My), we have evidently the right to substitute for (7 0) 
or H(j) any polynomial assuming the same signs for all the as. For example, 
M m = tHo3', hence H m may be replaced by hos = gog3/i- 

We may also subtract from any G^ or H y) the product of any polynomial 
by /o, and the residue so obtained may be used in place of the original function, 
not only for the calculation of M y) , but also in the formation of higher prod- 
ucts. 
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(iii) When some M (() is equal to =*= M, every M 0) calculated will at the same 
time furnish the value of some other. For, if (t ') and (i") together make up 
(t)> and if (/) is any set not overlapping (*), we know by (ii) that M^> tj ) = 
=±= M «»,», the ambiguity being in agreement with the former, where (t, /) means 
the set of elements belonging to either (i) or (/). 

In particular, if it is known that M { = =±= M for every i in some set (e), it is 
no longer necessary to calculate independently any M W) whose set of suffixes 
has elements in common with (e). 

These ideas will be generalized in §10. 

(iv) In certain cases, as when g is known to have all its roots real and dis- 
cinct, go and its successive derivatives will serve as the sequence (go, gi, . . . , 
g k ). In any case the Budan-Fourier theorem shows that the root-distribution 
indicated by the use of the derivatives will differ from the true distribution at 
most by the presence of pairs of consecutive /3's. It will be exact if no such pairs 
are indicated. 

(v) Failure of any of the hypotheses (i), (ii), or (iii) of §3 will always lead 
to reducibility of /u, if the degree of f is not less than that of go. 

6. Numerical examples 

In the folio #ing examples, substitutions in accordance with §5 (ii) are freely 
made without a corresponding change of notation. 

(i) Let/,(*) = *» + 3* 2 - 3% - 1, g (x) = x 2 -ix-7. 

Then /i = x 2 + 2% - 1; g, = * - 2, g 2 = 1. Therefore & = -9* 2 
+ Ax + 15, G 2 = * - 2, G12 = go = x 2 - Ax - 7. Hence 

H=x*+2x-l, Hy 17x 2 + 28x-l, H 2 3x 2 -2x + S, H 12 =x 2 -12x+l, 

H' = *, H\ = -83* + 23, H', = 8x - 3, H' a = -11* + 1 

if" = i, H"i = -1, tf" 2 = -1, ff'n = 1. 

V(- oo) = 3, Fi(- °o) = 2, V«(- oo) = 0, Vn(~ ») = 1, 

V(0) = 1, Fi(0) = 2, F 2 (0) = 2, Fi 2 (0) = 1, 

l/(oo) = 0, Vi(oo) = 1, y 2 (oo)=3, Vu(oo) = 2. 

If the interval is ( — oo , oo ), 

M = 3, Mi = 1, M 2 = —3, M12 = — 1. 

Hence E{t) = ,{3(1 + 2 + (1 + 0(1 ~ - 3(1 + (1- - (1- <) 2 } 
= t 2 + 2f. Also K = 2 and X = 0; so that the arrangement is afiaafi. 
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Tf the interval is (— oo , 0), 

M = 2, Mi = 0, M 2 = -2, M 12 = 0. 

Therefore E(t) = J {2(1 + t) 2 - 2(1 — t 2 )) = t 2 + *■ Moreover, X is now 1, 
showing that there are no (3's on the upper side, and the arrangement is a@a. 

In both cases Mu might have been found as the negative of M u as explained 
in §5 (iii). 

(ii) Let f = x 2 - 3x - 1, g = x 3 - 3* 2 - 3x - 1 ; a = - x , b = oo . 

We find that M = 2, iltfi = 0, M 2 = 0, M 12 = -2, M 3 = -2; and it 
follows that Mi 3 = -Mi = 0, M i3 = -Af 2 = 0, M m = -M 3 = 2. 

Hence £(f) = |{2(1 + *) 3 - 2(1 + t) 2 {l - t) - 2(1 + 0(1 - 2 
+ 2(1 — 3 } = 2* 2 . Since k = 2 and X = 1, the order of roots is aaB. 

7. More than two polynomials 

It is always possible to find the relative arrangement of roots for a system of 
polynomials by applying the method of §§3 and 4 to the several pairs. It may, 
however, be interesting to exhibit the solution of the problem by means of a 
single formula analogous to (6). 

Let one of the polynomials be/ (:K), having roots a u a 2 , ... in (a, b) ; and let 
the other polynomials (in finite number) be g'o, g"o, .... Let all the preceding 
notation be applied, with accents wherever necessary; and further let the prod- 
uct G' a^G" ({»•) ... be abbreviated to G^y* 

It will be assumed that neither a nor b is a root of / , and that/ has in (a, b) 
no multiple root or root in common with any of the Sturmian functions g' , 

g 1, g 2, ■ • • . g k'< g o» • • • > • • • • 

Let N r > r > ... or AT [r] be the number of a's lying in that ^-interval in which 
the sequence g'o, . . . , g V has r' variations, g"o, . . . , g'V has r" variations, 
and so on. By reasoning similar to that of §§3 and 4, we find that iV [r ] is the 
coefficient of t" t"' ... in the generating function 

(8) E{t\t\...)=2- k '- k "----j^ m (i+t r ) k '- s \i-tr'(i+t) k "- s "(i-n s \.-, 

[0)1 

where s' is the number of elements in (/'), s" the number in (/"), etc. 

It is evident that the remarks made in §5 apply without essential modification 
to the present case. 



* It will be noted that [ ] is the symbol of a permutation, ( ) of a combination. 
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8. Generalization of the range 

Hitherto we have taken account of all the roots of /o that lie in a given real 
interval (a, b). Let us now consider how formula (8) or (6) must be modified 
when only those roots are counted which, besides lying in (a, b), satisfy a 
given system of algebraic inequalities. 

Let us have then, in addition to the functions already assumed, a system 
of n auxiliary polynomials G 1 (x), G 2 (x), . . ., G n (x) without roots in common 
with/o, and let the inequalities in question be 

6* < f or every i in a certain set (e); 
G* > for the remaining i's. 

Let the a's satisfying these inequalities be denoted generally by a. 

Let G[o)] denote the product of all the G's whose indices (upper or lower) 
appear in the symbol, and let us define in connection with this product the cor- 
responding quantities of types M, H, and V. Our object being to find N[ r ], 
the number of a's in a particular /3-interval, we shall study the generating func- 
tion E(t\ t", •••) = JjJlrf't'""... ■ 

Let us denote by E w (t', t", . . .) the function 

2-*'-*"- • • -X>f&)](l + t>) k '-°'(i - t')s'. . ., 

[0)1 

obtained from E(t', t", . . .) by affixing the set of upper indices (i) to every M , 
observing that E W is not in general the generating function of a distribution 
and that some of its coefficients may be negative. It will be called a "modified 
distribution function." "There is clearly a set of relations, analogous with (l), 

(9) E(- 1)« W ' W, I = £ ( *\ 

where the summation is for all combinations (e) out of 1,2, . . . ,n, and where 
the E in any term is formed according to the inequalities associated with the 
(e) in question. From these we obtain, as (2) was obtained, 

(10) E = 2-"X;(-l) !W ' ( * )1 -E ( ' ) - 

(0 

The M's appearing in £ (,) being given by the formula 

(11) M% } = V$),(a) - Vf8,j(&). 
the function E is determined. 
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In practice we can of course arrange to have all inequalities of the form G' 
> 0, when (10) reduces to E = 2~"^ w £ (0 . 

There are two ways in which, starting from the range ( — oo , oo ), we may arrive 
at the interval (a, b) by the use of (10). (i) We may use the two inequalities 
G l = x - a > 0, G 2 = x - b<0, obtaining E = \{E + E l - E 2 - £ 12 ). 
(ii) We may use the single inequality G 1 = (x — a)(x — b) < 0, obtaining 
E = \{E — JE 1 )- Of these, (ii) still gives correct results when a ^ 6, if we 
agree that (a, b) and (b, a) are the same interval; while (i) is correct in this case 
if by (a, 6) we mean the interval a < x < b. In the method of §3 it is necessary 
that a ^ b. 

9. THE MODIFIED DISTRIBUTION FUNCTION 

It is interesting to examine a little the meaning of the modified function 
E , at least in a simple case. Without being a true distribution function, it 
has some relation to the distribution, as equation (9) shows. 

Let us consider the case of two polynomials / and go with a single auxiliary 
function G 1 , and denote now by E + the generating function of the root-distri- 
bution in that part of (a, b) where G 1 is positive, and by E~ the same for G 1 
negative. With E 1 as already defined (as a special case of E ), we have, by 
(9), E = E+ + E- and E 1 = E+ - E~; so that E - 2E~ = E l = -E 
+ 2E+. 

It follows that the general coefficient AfJ in E 1 does not exceed in absolute 
value the corresponding N r , and differs from it by an even number. In parti- 
cular, |iVj| will correctly enumerate the as in the /3-interval whenever either 
N~$ or iV"7 is zero; that is, in any /3-interval where the a's are not separated by 
a root of G 1 . 

The information given by E l is complete when /JiV*| = M. In other 
cases the total error in the enumeration of the a-groups by E 1 is M — / ANl\. 

Two special cases may be noted. 

(i) If G 1 = go, all the a-groups will remain undivided, and E 1 will give exact 
information. In fact, for this case, if go has no roots in (a, b) of even multi- 
plicity, EH') = *£(-*)• 

(ii) On taking G l = go/i, we find that the function H w is replaced by goG^y 
Every N\ is now or ± 1, according as N r is even or odd. The a-groups have 
therefore been reduced modulo 2. To arrive at the Cauchy-Sylvester theory, 
we must reduce both a- and /3-groups, which is accomplished by making t = 1. 
In fact, £ x (l) is equal to M 1 , which is the loss of variations in the sequence 
5(/o, go) ; in other words, the Cauchy index. 
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10. The M-INEOUAUTIES 

Although the M w are independent quantities, in the sense that they are 
connected by no identical equation, they satisfy a number of linear inequalities, 
which, in many cases, enable us to dispense with a part of the work in computing 
them. These inequalities express the fact that the L w are essentially «£ 0, 
so that they take the form 

(12) ^£(-l) {w - U)] M u) ^0. 

CO 

On account of these relations, a single equation in the M's may be equivalent 
to a number of equations. For example, the equation 22 s P s (k — 2s) = 0, 
being the same as /,;Li = 0, breaks up into the k equations L t = (i — 1, 
. . ., k), which again may be written in terms of the M's. 

For the purpose of direct numerical application of the theory, the most im- 
portant consequences of (12) are contained in the following theorem. 

If, for some set (e) contained in a second given set (d), 

(A) £(-l) !w - wl M (i) = 0, 

(i) in « 

it will follow that, for every (;') without elements in common with (d), 

(B) E(-1) {(0 ' W) M ( .-,-) = 0, 

(<) in (<0 

where (i, j) is the set whose elements are those of (i) and those of (/). The 
summation is for all subsets (j) of (d). The proof is that (A) expresses the 
vanishing of the sum of those L (e r) for which (r) has no elements in common 
with (d). These L (< , ( r) must separately vanish. Hence 

E(-l) ,w - w, L fcr ,-0. 

which reduces to (B). 

It is natural to compute the M's in some such order as M, M,-, Mj, M,>-, M P , 
M {p , M jp , M ijP , .... If some M (i) is found, with preceding M's,to satisfy 
a relation of type (A), all the M's whose suffixes include (d) are given by 
equations of type (B) in terms of M's already found. The smaller the set (d), 
the more numerous are the consequences. When (d) has only one element, 
we get the rule of §5 (iii) with respect to the condition M 4 - = = M, and the 
values of one half of the M's are given in terms of the others. If there are 
two elements in (d), (A) will have one of three possible forms; for instance, 
if (d) is 1, 3, and (e) is 3, (A) becomes M + Mi - M 3 — M i3 = 0, and (B) is 
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of the form M(/>) + -M"i(#) - Msit) — Mn(p) = 0, which gives a quarter of the 
M's, namely those of the form MnW- 

This theorem remains true when (;') is any set, and also when the individual 
suffixes are replaced by sets of suffixes, provided we understand by (t, j) the set 
of all elements occurring in one but not both of (i) and (/)• Thus, from M 
+ Mjs + Mn + M 23 = we deduce M u + M 24 + M 34 + Mm* = 0. 

A reason for the frequent intervention of this principle in shortening numerical 
work is in the limited choice of values for M (d) , which narrows as the number of 
suffixes increases. Since /,M ffl must be divisible by 2 s , where q is the 

(«') in (<0 

number of elements in (d), M (d) is known beforehand modulo 2 q , and this 
alone will enable us to foretell the value of M w if 2* > 2M. 

There is no difficulty in extending the principle of this section to the more 
general methods of §§7 and 8. 

11. Application to special problems 

In the following applications (i) to (vi) the reference is to two polynomials in a 
single interval (finite or infinite). 

(i) The question whether all the a's are to be found in a particular set of 
/3-intervals reduces to the question of the linear dependence of / tS P,u on 
(1 + u) k ~ n (1 - uf 1 , (1 + u) k ~" 2 (l - m)' 2 , . . . ; but in particular cases, 
the conditions may take various forms through the principle discussed in §10. 

Thus (under assumption (iv) of §3) the fi's will be entirely unseparated by the 
a's if and only if the sum vanishes of all L's with more than X and less than k 
suffixes, which reduces to a single equation in the M's. 

In particular, if all the roots of go are simple and interior to (a, b), so that 
k = k and X = 0, a necessary and sufficient condition that the a's do not separate 
them is thatM - L - L n . . . k = 0; which reduces, first to "%2[M - M w ] 

(i) even 

= 0, and finally to the condition that all M (l) with even numbers of suffixes 
are equal. By §5 (iii), it is necessary (though not sufficient) that the M w 
with odd numbers of suffixes are also equal. When the roots are so arranged, 
the manner of partition is completely determined by N = h(M + Mi), Nf, 
= \(M - M0. 

(ii) A condition for k /3's, each less than any one of the a's, is M { = M for 
every i. 
For k |8's, each greater than any a,M t = —M for every *'.* 
(iii) For the a's to occur in alternate ^-intervals only, P s = eP k _ s for 
every s, where e is either 1 or — 1 ; which, by §5 (iii), is equivalent to the one con- 

* See also the note to (vi). 
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dition M = ± w o4 . This condition may also be obtained from elementary 
considerations. 

(iv) The partition of the as is ordinally symmetric about the position where 
the gj's have k/2 variations if and only if P r — for every odd r. 

(v) To have an arrangement consisting of (n + 1) groups of r a's each, 
alternating with n groups of s /3's each, where ns = k, it is necessary and suffi- 
cient that 

r l(i + u) ns + (i + M ) (M - 1,s (i - u y+ • ■ • + a - u) ns \ =* p„ + Pi« + • • • + p*»\ 

It is therefore necessary that P t = for every odd i. 

If 5 is even, the equations of condition reduce to one more than half their origi- 
nal number, owing to the inclusion of the equation M — M 12 . . . k, and involve 
only one more than half the M's. For instance, (taking n = r = s = 2) the roots 
of a sextic and a quartic occur alternately in pairs whenever M = Muu = 6, 
M a + M u + M 2S = 10, and Mi + M 2 + M 3 + M 4 = 0. 

When r = s = 1, the conditions are P 2l = ( 2> - + J), P 2 ,- + i = 0; but the 
following method is simpler. 

(vi) Let go have no multiple roots in (a, b). The modified process with the 
auxiliary function G x = go/i shows that a necessary and sufficient condition for 
k (or k + 1) a's alternating with k j3's is M = fe (or k '+ 1), M 1 = ±M. 
These are Sylvester's results, and represent the case where his restricted theory 
gives complete information. The tests involve very little calculation. It is 
easy to see that k may here be interpreted as the number of /8's*. 

(vii) The theory of the generalized range in §8 enables us to find rationally 
the number of separate intervals of the real number system satisfying a set of 
algebraic inequalities 

(A) G > {i = 1, 2, . . . , «), 

the C being real polynomials in x without common or multiple roots in the range 
(—00, oo). 

To complete the system of §8, let/ denote the first derivative of the product 
G^G 2 . . .G", and suppose that/ has no real multiple root. Let go = 1- Since 
k = 0, the ^-functions reduce to constants (the corresponding M's). Let us 
introduce another auxiliary function G° = f%, and let M+ denote the number of 
real roots of / making G° positive and satisfying (A), M~ the number making 
G° negative and satisfying (A). 

* The result of (vi) may be used to prove the following theorem. Let /< and g% be the tth 
derivatives of /o and go, and let m be the loss of variations between a and b in the sequence 
S(fi-igi, f*gi-i). Let pi be the degree of fi-igi, and q the degree of the lower (or both) of 
/<, and go- A necessary and sufficient condition that the roots of / and go may be all real and 
simple and in (a, 6), with the least root of /o greater than the greatest root of go, is that 
it; = pi (i = 1, ...,g). 
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Equation (10) degenerates in the two respective cases to 

M+ = 2-"- l Y,{M w + ikf°»}, M~ = 2-"- 1 J2{M w - M°w}, 
tf) («) 

(i) denoting in turn every subset of 1, 2, . . ., n. A little consideration then 

shows that the number of separate finite intervals is M~ — M + , that is 

— 2~" ]/J(/)M oW . The question of infinite regions may be settled by means 

of the signs of the coefficients of the highest terms of the G's. 

It will be seen that, as a result of adjoining the derivative of /o, we may re- 
move a square factor from i? 0(,) , so that M oW is the number of variations lost 
between — oo and oo by the sequence 5(/o, G ), and G° does not actually 
enter into the calculations. 

As an example, let G l = 1 - x 2 , G 2 = x 2 - 2x (» = 2). Then/ = -2# 3 
+ 3* 2 + x - 1 ; M ° = loss of variations in S(/ , 1) = - 1 = - 1 ; M 01 = loss 
of variations in 5(/ , G 1 ) = 2 - 1 = 1; M 02 = -1; M 012 = -3. Hence the 
number of finite intervals is — 1( — 1 + 1 — 1 — 3) = 1. There is no infinite 
region. 

(viii) A second method for the problem of (vii) involves generally more 

sequences, but simpler ones, and allows / to have real multiple roots. Let 

iM be the value of M {J when G* is used in the place of /o, (/) n °t containing i. 

Let I be the number of infinite regions satisfying (A), as found from the signs 

of the highest terms. It may be shown then that the number of separate regions, 

finite and infinite, defined by (A) is 2~"^ Z) M® + J- 

» 0) 
Thus, if G 1 = x 2 - Qx + 8, G 2 = x 2 + 6x + 8, and G 3 = * (» = 3), we shall 

find that xikf = t M 2 = iM 3 = jM 23 = 2, zM = 2 M 1 = 2, 2 M 3 = 2 M 13 = -2, 

3 M = 3M 1 = 3 M 2 = 3 M 12 = 1 ; and the functions are all positive to the right, 

but not to the left. Hence the number of regions is |(6X2 —2X2 + 4X1) 

+ i— 2. 

12. Restricted distributions 

It will sometimes happen that, before applying the general theory, we have 
a partial knowledge of the root-distribution which enables us to assume for E 
an expression of the type e + 22i " i c i e i> where the coefficients c t are inde- 
pendent of t (or of t', t", . . .). and fewer than k + 1 in number, and e, e t are 
known functions of t(or of t', t", . . .). In such a case it is not usually necessary 
to use all the M's in determining the distribution. If, starting with the 
equation 

e + 2>< = 2"*5>,(1 + t) k ~ s (l - ty, 

i s 

we differentiate successively, and then put t in turn equal to 1 and —1, we get 
2k + 2 equations which, if the e { are linearly independent, determine the n 
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constants C(. From these equations we may select any n whose determinant 
is not zero, and we shall naturally do so in such a way as to use as few M's as 
possible, noting that, when t = 1, the rth derivative of E involves no M with 
more than r suffixes, and that, when t = — 1, it involves none with fewer than 
k - r. 

Let &n'.'.'.*"n-r be the w-rowed determinant whose first v rows consist of 
the derivatives of order pi, p 2 , . . ., p„ of the e it with t = 1, and whose last 
n — v rows consist of derivatives of order a x , <?2, ■ • •> <* n -v> with t = - 1. 
Let us examine this determinant in some special cases. The problems most 
naturally presenting themselves are those in which e f is of the form f'(l + t)" 
(1 — t)"<p(t) (i = 1, . . ., n), where <p(t) is not divisible by 1 + t or 1 — t. 
Let fi of the r,- be even and n - n of them odd.' It may then be shown that 
Aff',£ + \\\\ \\ *<, \ n-l-i differs from zero when v = or n, as also when the smaller 
of v and n — v is equal to the smaller of ju and w — ju, but is equal to zero when 
the smaller of v and n — v is greater than the smaller of n and n — fi. 

For example, whenever k — X is less than k, we have an instance with p = 
<r = and <p(0 = 1. If then go has only two roots in (a, b), the positions of the 
as with respect to them may be found by means of P , Pi, and Pi) P k , P k -i> 

Pk-t', Pq> Pu Pk'< or Po> Pk> Pk-v 

Another example is afforded by the case where the distributions of the a's 
and the jS's with respect to some special set of numbers are already known, so 
that their mutual arrangement is known subject to certain transpositions of 
adjacent roots. In such a case p = 1, u = 0, and <p(t) = 1. Thus, the arrange- 
ment being (aj8)(aa/3j8)j8(aaj3), with the order in each bracketed group unde- 
termined, we have a case where n and n — n are 1 and 3 (or 3 and 1), and the 
complete determination is made by means of Pi, P k , Pk-i, Pk-v Du t. not 
by means of Pi, Pi, P k , Pk-v 

A second type of restricted distribution has its generating function of the 
form e + <p(t) • 22 " i c /*> where now only the exponents r t are unknown. 
If <p(l) is not zero, we find, by successive differentiation and putting t = 1, 
^ftf in terms of P , Pi, . . • » Pj for / = 1, 2, . . . . The first n of the resulting 
equations must, if the c { are all different from zero, be solvable rationally in the 
n unknown r it though possibly not without ambiguity. Thus, if the first two 
equations so obtained are 2n + r 2 = 12, 2r\ + r\ = 54, either n = 3 and 
r 2 = 6, or n = 5 and r 2 = 2. The value of 2r\ + r\ will decide. 

Generally, it may be proved that, if the first 2n equations are used, there 
cannot be two solutions in non-negative r's unless they are mere permutations 
of one another determining the same distribution.* 

When the coefficients c { are all positive, the n equations 2l> " iC-ij = Cj 

* This can be proved from Descartes' rule by making the transformation / = log y, log rt 

«= pi. 
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(j = 1, 2, . . . , n) do not admit more than one solution satisfying the condi- 
tions n ^ ri g • • • ^ r„ ^ 0.* Thus, in the example already used, if the 
a's are known to fall into a group of two to the left and a single one to the right, 
the fact that these are in the intervals of five and two variations (three and 
six being now excluded) will be discovered with the aid of P , Pi, and Pi only. 

In particular, if the c,- are all unity, the first n equations obtained will always 
give a unique solution of the problem. Clearly the c's may always be taken as 
unity, by using enough of them. 

There are two conspicuous cases to which this method applies. The first is 
that in which M < k + 1. We may then take E(t) = ]T\ " /»', and the 
values of Pq, P u . . ., P M will suffice. 

The other case is that of the relative distribution of the roots of a polynomial 
and those of its derivative over the range (-00,00). If g = f u E is of the 
form ]T\ * x ** — (1 + py].- " 1 t r, ' l and the present method gives k — X + 1 - 2m 
= Pa, £),. * yi - 2%2f { ~ n = kP /2 - Pi/2,etc. The first equation gives n, 
and the next n equations give in succession 2j,r,-, 2_/' r «' 2 > • • • » ]C« r »" ^ n terms °f Po, 
Pi, . . . , P„, thus determining the positions of the k - X turning-points of / 
with respect to the roots. 

If in this problem the range is not ( — 00 , 00 ) but (a, b), the formula for E(t) 

isZUV " (1 + ]£:/' - Hi - sgn\Mb)Mb)]}t X - !{l + 
sgn [/o(a)/i(a)]}f, the theory being otherwise the same. 

13. Method by quadratic forms 

An alternative method for the determination of M[y)j depends on the 
theory of real quadratic forms, and corresponds to Hermite's theorem on a single 
polynomial, as the previous method corresponds to that of Sturm. 

Let £1, £2, . . . , % t be all the roots of /o, real and imaginary, counted each as 
often as its multiplicity requires. We shall suppose still that assumptions 
(i), (ii), and (iii) of §3 hold good. 

By §8, it will be sufficient to consider the case where the range is ( — 00, oc), 
and for this range we know, by Hermite's principle, f that Af[$)j is the excess 
of positive over negative coefficients in any sum of real square terms to which a 
real linear transformation on the T's will reduce the quadratic form 



t^i^-jS^} 2 } 



* This follows in a similar way from a theorem by Laguerre. See Acta Mathe- 
m a t i c a, vol. 4 (1884), pp. 111-114. 

t Sur le nombre des ratines d'une equation algtbrique comprises entre des limites donnees, 
Extrait d'une lettre a M. Borchardt, Journal ftirMathematik, vol. 52 (1856), 
p. 39. 
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The matrix of this form is ( ]j£f/ + 'G$ ](&)), . ... /) _ 1 

\r- 1 /< = o',...', i>-l • 

If any sequence of principal minors of this matrix is selected (as it may always be) 

such that each minor contains the preceding and no two vanish in succession, 

and if W\ [yj j is the number of variations of sign in the sequence, we have the 

formula 

(13) M t % = P -2 W$ } . 

In numerical work by this method, a large part of the labor will occur in evalu- 
ating the determinants. As to the determination of the elements of the matrix, 
the theory of partial fractions provides a convenient method. For the element 
in the (s + l)th row and (t + l)th column is the coefficient of %~ s ~'~ in the 
expansion of /7[y)](x)//o(x) in descending powers of x. Since #[%] 
is the product of /i by a number of G's, all the necessary expansions may be 
obtained by multiplications from the single expansion representing fi(%)/fo(x). 
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